Search system with query refinement and search method

ABSTRACT

A search system and method for searching for electronic documents, and providing a search result in response to a search query. In one embodiment, the search system includes a processor, a user interface module adapted to receive a search query from a user that includes a search query term, a search engine that electronically searches for electronic documents that satisfy the search query term, and a query processing module that identifies at least one alternative term for the search query term, and provides the alternative term to the user interface module for display to the user, and also provides the alternative term to the search engine to search for electronic documents that satisfy the alternative term.

This application is a Continuation of U.S. patent application Ser. No.14/192,177, filed Feb. 27, 2014, and entitled SEARCH SYSTEM WITH QUERYREFINEMENT AND SEARCH METHOD, which a Continuation of U.S. patentapplication Ser. No. 11/648,901, filed Jan. 3, 2007, and entitled SEARCHSYSTEM WITH QUERY REFINEMENT AND SEARCH METHOD, and now U.S. Pat. No.8,694,530, which claims priority to U.S. Provisional Application No.60/755,073 filed Jan. 3, 2006, the contents of which are incorporatedherein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention is directed to a search system and search methodthat includes query refinement, and to such a system and methodincluding a user interface for facilitating user refinement of a searchquery.

2. Description of Related Art

Electronic searching across large document corpora using a search engineis one of the most broadly utilized applications on the Internet. Onefundamental search technique used is the keyword-index search thatrevolves around an index of keywords from eligible target items. In thismethod, a user's inputted query is parsed into individual words(optionally being stripped of some inflected endings), whereupon thewords are looked up in the index, which in turn, points to documents oritems indexed by those words. Thus, the potentially intended searchtargets are retrieved. This sort of search service by a search engine isaccessed countless times each day by many millions of computer andInternet users, and is the basis of the Internet search servicesprovided by Lycos®, Yahoo®, and Google®.

Two main problems of keyword searches are (1) missing relevant documents(i.e. low recall), and (2) retrieving irrelevant ones (i.e. lack ofprecision). Most keyword search results obtained by a search engine doplenty of both. In particular, with respect to the first problem, theprimary limitation of keyword searches is that, when viewedsemantically, keyword searches can skip about 80% of the eligibledocuments because, in many instances, at least 80% of the relevantinformation will be indexed in entirely different words than wordsentered in the original query. Granted, for simple searches with verypopular words, and where relevant information is plentiful, this is notmuch of a problem. But for longer queries, and searches where therelevant phrasing is hard to predict, results can be disappointing.

When irrelevant or inadequate search results are retrieved by the searchengines in response to the submitted search query, the user typicallyexpands and refines the search query. This is generally accomplished bymanually deleting, adding, and/or replacing words in the original query,which all requires the user to manually type in such changes into thesearch field of the search engine. This also requires to the user todetermine, look up, or guess, which terms in the original search queryshould be modified or replaced in order to obtain better search resultsfrom the search engine. Of course, further reformulated search queriesmay be required over and over again, in order to have the search engineultimately retrieve the desired documents. This need for reformulatingthe search query is a common source of frustration for users of searchengines who initially thought that the desired documents would beretrieved readily from their initial search query submitted.

The need for reformulating the search query is caused by the fact thatthe desired documents will have content with substantially the samemeaning as the user's entered search query, but which are expressed indifferent grammar or using different vocabulary. Users can recognizeinstantly the relevance of such divergent expressions, but machines suchas computer implemented search engines acting on the basis of matchingkeywords cannot make even the most common-sense determinations ofsameness of meaning, even when the search query terms are enhanced withwildcards, spelling corrections, and stemming.

For example, a travel website containing a large database of hotelinformation that is well-structured, with tens of thousands of items,currently cannot determine that “hotel with activities for kids” is thesame in meaning as “resort offering a recreation program for children”.However, users could make that determination easily and instantly, evenif the user possesses little knowledge of the travel industry. Becauseof the limitations of machines such as computers and software, the userssubmitting the search query have the task of reformulating queries inalternative ways rather than machines.

Therefore, there exists an unfulfilled need for a search system andmethod with query refinement that addresses the above noted limitationsand disadvantages of conventional search engines and search methods. Inparticular, there exists an unfulfilled need for a search system andmethod that reduces the need for user's typing to delete, add, and/orreplace the query words to refine the search query. In addition, thereexists an unfulfilled need for such a search system and method whichutilizes a user interface that supports, and does not detract from, thesearch task.

SUMMARY OF THE INVENTION

In view of the foregoing, an advantage of the present invention is inproviding a search system with query refinement that facilitates theuser in refining the search query.

Another advantage of the present invention is in providing such a searchsystem with a user interface that reduces the need for deleting, adding,and/or replacing the words of the search query by typing.

Still another advantage of the present invention is in providing asearch system with a user interface that supports, and does not detractfrom, the search task.

Another aspect of the present invention is providing a computerimplemented method for searching electronic documents in whichrefinement of the query is facilitated.

Yet another aspect of the present invention is providing a computerreadable media having instructions for providing a search engine thatfacilitates query refinement.

In accordance with one aspect of the present invention, a search systemfor searching for electronic documents and providing a search result inresponse to a search query is provided. In one embodiment, the searchsystem includes a processor, a user interface module adapted to receivea search query from a user that includes at least one search query term,a search engine that electronically searches for electronic documentsthat satisfy the at least one search query term, and a query processingmodule that identifies at least one alternative term for the searchquery term, and provides the alternative term to the user interfacemodule for display to the user, and also provides the alternative termto the search engine to search for electronic documents that satisfy thealternative term. Preferably, the search engine executes a search basedon the search query term and the alternative term simultaneously.

In accordance with one embodiment, the search system further includes aterms database having a plurality of alternative terms for the searchquery term, the query processing module retrieving the alternative termfrom the terms database. The alternative term is a synonym of the searchquery term, and is preferably used in a same sense as the search queryterm is used in the search query. In another embodiment, the userinterface module is adapted to allow the at least one alternative termto be unselected and removed from searching by the search engine. Inaddition, the at least one alternative term is a plurality ofalternative terms, and the query processing module is adapted to provideonly a selected number of the plurality of alternative terms to thesearch engine for searching. Furthermore, the user interface module isadapted to display the selected alternative terms. In accordance withanother embodiment, the user interface module further displaysadditional alternative terms not provided to the search engine forsearching upon user's selection of a link, and to allow user to selectthe displayed additional alternative terms to be provided to the searchengine for searching.

In accordance with one implementation, the user interface module isadapted to display the canonical sense in which the search query term ispresumed to be used. In addition, the user interface module is adaptedto display alternative senses in which the search query term is used,and to allow user selection of the sense. Furthermore, the userinterface module may be implemented to allow user defined sense to beadded to the search system for the search query term.

In accordance with another embodiment, the user interface module isadapted to display domain category in which the search query term ispresumed to be used. In such an implementation, the user interfacemodule is adapted to display alternative domain categories in which thesearch query term is used, and to allow user selection of the domaincategory.

In accordance with still another embodiment, the user interface moduleis adapted to display part-of-speech in which the search query term ispresumed to be used. In such an implementation, the user interfacemodule is adapted to display alternative part-of-speech in which thesearch query term is used, and to allow user selection of thepart-of-speech.

In accordance with another aspect of the present invention, a computerimplemented search method for electronically searching documents, andproviding a search result in response to a search query is provided. Inone embodiment, the search method includes receiving a search query froma user that includes at least one search query term, electronicallyidentifying at least one alternative term for the at least one searchquery term, electronically displaying the identified at least onealternative term for the user, and electronically executing a search forelectronic documents that satisfy at least one of the search query termand alternative term. Preferably, the search is executed based on thesearch query term and the alternative term simultaneously.

In accordance with another embodiment, the alternative term is a synonymof the search query term, and is used in a same sense as the searchquery term is used in the search query. In still another embodiment, thesearch method includes unselecting the alternative term to indicate thatthe alternative term is not to be searched. In yet another embodiment,the at least one alternative term is a plurality of alternative terms,and the search method includes searching for only a selected number ofthe plurality of alternative terms, and displaying the selectedalternative terms for the user. In this regard, the method may includeselecting alternative terms to be searched, and unselecting alternativeterms not to be searched.

In accordance with one example embodiment, the search method includesdisplaying for the user the canonical sense in which the at least onesearch query term is presumed to be used. In one implementation, thesearch method also includes displaying alternative senses in which theat least one search query term is used, and receiving user's selectionof at least one alternative sense. In another implementation, the searchmethod also includes adding a new sense for the at least one searchquery term.

In another example embodiment, the search method further includesdisplaying for the user, a domain category in which the at least onesearch query term is presumed to be used. In addition, the methodincludes displaying alternative domain categories in which the at leastone search query term is used, and receiving user's selection of atleast one alternative domain category.

In yet another example embodiment, the search method includes displayingfor the user, a part-of-speech in which the at least one search queryterm is presumed to be used. In addition, the method includes displayingalternative parts-of-speech in which the at least one search query termis used, and receiving user's selection of at least one of thealternative part-of-speech.

In accordance with still another aspect of the present invention, acomputer readable medium with executable instructions for implementingthe above described system and method is provided.

These and other advantages and features of the present invention willbecome more apparent from the following detailed description of thepreferred embodiments of the present invention when viewed inconjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic illustration of a search system with queryrefinement in accordance with one embodiment of the present invention.

FIG. 2 is an interface screen generated by the search system inaccordance one example implementation.

FIG. 3 shows the interface screen of FIG. 2 where the link “ShowExcluded Words” has been selected by the user.

FIG. 4 illustrates a sense drop down menu for facilitatingdisambiguation of the sense for the terms of the search query.

FIG. 5 illustrates sense scoring provided to the user.

FIG. 6 illustrates an interface screen which allows the user to add asense for a word.

FIG. 7 shows the interface screen of FIG. 6 when adding a proper name.

FIGS. 8A to 8C schematically illustrate time-sequenced instances of adomain category screen in accordance with one example implementation.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

FIG. 1 is a schematic illustration of a search system 10 in accordancewith one embodiment of the present invention. The search system 10receives a user search query and provides a search result, much like theconventional search engines. However, as explained in further detailbelow, the search system 10 includes features for facilitating queryrefinement which proposes alternative search terms to the user so thatthe user can then interact with the search system 10 to indicateappropriate alternative search terms that can be used to refine thesearch query, and retrieve the desired documents as search results.Moreover, the search system 10 also includes features for providing wordsense information and alternative sense information, and a userinterface that allows the user to change the word sense.

It should initially be understood that the search system 10 of FIG. 1may be implemented with any type of hardware and/or software, and may bea pre-programmed general purpose computing device. For example, thesearch system 10 may be implemented using a server, a personal computer,a portable computer, a thin client, or any suitable device or devices.The search system 10 and/or components thereof may be a single device ata single location, or multiple devices at a single, or multiple,locations that are connected together using any appropriatecommunication protocols over any communication medium such as electriccable, fiber optic cable, or in a wireless manner.

It should also be noted that the search system 10 in accordance with thepresent invention is illustrated and discussed herein as having aplurality of modules which perform particular functions. It should beunderstood that these modules are merely schematically illustrated basedon their function for clarity purposes only, and do not necessaryrepresent specific hardware or software. In this regard, these modulesmay be hardware and/or software implemented to substantially perform theparticular functions discussed. Moreover, the modules may be combinedtogether within the search system 10, or divided into additional modulesbased on the particular function desired. Thus, the present invention,as schematically embodied in FIG. 1, should not be construed to limitthe search system 10 of the present invention, but merely be understoodto illustrate one example implementation thereof.

Referring again to the illustrated embodiment of FIG. 1, the searchsystem 10 includes a processor 12 that is electronically connected to auser interface module 16, a search engine 20, and a query processingmodule 30, these modules being electronically interconnected by theirconnection to the processor 12 in the illustration of FIG. 1. The searchsystem 10 also includes a refinement words database 40 which iselectronically connected to the query processing module 30, and arecords database 50 which is electronically connected to the processor12. The functions of the various modules and components of the searchsystem 10 are controlled by the processor 12, these functions beingdescribed in further detail below.

In addition, the search system 10 is further electronically connected toa documents database 22 via a network 2, the documents database 22having a plurality of electronic documents 24 stored therein. It shouldbe understood that as used herein, the term “electronic document” refersto any computer readable file, regardless of format. For instance, webpages of websites, word processing documents, presentation documents,spreadsheet documents, PDF documents, etc., are all examples ofelectronic documents referred to herein. It should also be evident thatthe documents database 22 illustrated is remotely located, and is incommunication with the search system 10 via network 2 which may be theInternet. Of course, the documents database 22 is merely representativeand a plurality of databases may be connected to the network 2 andaccessed by the search system 10.

The user interface module 16 is adapted to receive a search query suchas one or more search terms or a phrase from a user 6 of the searchsystem 10 via a terminal 4 that is also connected to the network 2 suchas the Internet. The search query is indicative of the content ofdocuments that are of interest to the user 6. The user interface module16 provides the search query to the query processing module 30 whichprocesses the search query in the manner described in detail below, sothat the search query can be provided to the search engine 20. Thesearch engine 20 is adapted to search the documents database 22 toidentify those documents from the plurality of electronic documents 24that satisfy the search query that has been processed by the queryprocessing module 30. Of course, searching of the documents database 22by the search engine 20 may be attained in any appropriate manner, andcan be facilitated by utilization of word indexes as already known inthe art. Those electronic documents 24 which satisfy the search queryare identified for the user 6 in a search results page generated on aninterface screen on the terminal 4.

In the above regard, in accordance with the present embodiment, thequery processing module 30 analyzes the terms or phrases of the searchquery received by the user interface module 16, and identifies andretrieves from the terms database 40, those terms that can be proposedto the user 6 for refining the search query which will enhance thelikelihood that relevant documents are retrieved from the documentsdatabase 22 by the search engine 20. The alternative search terms thatare stored in the term database 40 may be synonyms to the search queryterms and be obtained from reference sources such as WordNet orelectronic thesauri. Thus, the query processing module 30 of the searchsystem 10 functions to identify those terms in the terms database 40which are most likely to increase recall of the search results, whilealso preserving precision and relevancy of the documents retrieved. Inone embodiment, this processing by the query processing module 30 may beperformed before or after the actual search is performed by the searchengine 20. However, in the preferred embodiment, this processing isperformed simultaneously while the conventional search is performed bythe search engine 20, and a selected number of these alternative termsidentified by the query processing module are simultaneously searched bythe search engine 20.

The advantage of the search system 10 in accordance with the presentinvention is apparent by the following example. Consider the apparentlysimple search query “kid pool”. Most people will instantly understandthat this query is about small swimming pools designed for children, andthat “children's swimming pool” or “toddler's wading pool” are equallyas relevant as the words of the search query. Now supposing thedocuments retrieved by the search engine in response to the search querydoes not include the document or information for a particular kiddy poolthe user had in mind, the user would be in the all-too-familiarsituation of coming up with alternative query strings or terms that mustbe resubmitted to the search engine. Rather than force the user to comeup with them all from scratch, the query processing module 30 of thepresent invention provides suggestions to the user 6 that can be easilyselected via the user interface 18 for incorporation into the searchquery.

For example, the search system 10 of the present invention can informthe user 6 that the term “children” is an alternative to “kid”, andinclude such term in the search by the search engine 20 as well. Thesearch system 10 can then offer to the user, the chance to de-select“children” via a user interface displayed by the user interface module16 so as to remove this term for consideration if desired so that thesearch engine 20 can dynamically modify the search query being searchedfor. In addition, other less likely to be related terms such as“toddler” can be proposed to the user 6 if the user 6 indicates theirdesire to consider them (for example, by providing a link to such terms)so that they can be selected via the user interface for inclusion in thesearch by the search engine 20, preferably with single click, ratherthan having to type the term “toddler”.

The providing of alternative search terms in response to a typed searchquery is not a simple task. In particular, the provision of alternativesearch terms may be difficult due to the number of synonyms for thewords and the numerous different canonical senses in which words can beused. In the above noted example, there are more than 140 equivalentterms in an electronic thesaurus for the terms “kid” and “pool”combined. Automatically submitting all of such terms to the searchengine 20 and/or displaying a list of 140 possible synonyms (andhypernyms and hyponyms) to a user 6 for a simple two-word query is notreally a tenable option and is not useful to the user 6 submitting thesearch query. Such submission of all terms will likely result inreferences directed to subject that is clearly not related to the user'sintended subject, and such an exhaustive presentation of alternativeproposed search terms would severely detract from the search task.

In view of the above, the search system 10 in accordance with thepreferred embodiment of the present invention only automatically submitsselected number of alternative terms to the search engine 20, and onlydisplays selected number of alternative terms for consideration by theuser. Preferably, only a predefined number of the most pertinentalternative terms, or that which has the highest probability of beingrelevant which are automatically submitted to the search engine 20, aredisplayed for the user, without submitting or showing all possiblealternative terms (all 140 possible terms in the case of the searchquery “kid pool”).

In the preferred implementation, the alternative terms stored in therefinement words database 40 may be organized in clusters of words or“wordsets” that are grouped together based on having similar usagepatterns or belonging to the same canonical word sense. The canonicalsenses of the search query terms and the alternative terms may beobtained from reference sources such as WordNet and stored in the termsdatabase 40 and/or records database 50, or alternatively, retrieved asneed from a reference source. The query processing module 30 can thenidentify and provide to the user interface module 16, the terms of thecluster that is deemed most likely to be relevant and for the correctcanonical sense of the terms in the search query. Since even a singlewordset may have too many words within it, a further collapsing can beperformed by selecting just the most relevant words within the canonicalsense of the query search term, and showing only select number of wordslikely to be the most relevant for automatic submission to the searchengine 20 and display to the user 6, while hiding the rest of the wordsfor optional display to the user for selection and submission to thesearch engine 20.

One system and method for determining which alternative terms are to beselected and initially proposed to the user and submitted to the searchengine 20 is described in detail in patent application Ser. No.11/401,448 filed Apr. 11, 2006, the contents of which are incorporatedherein by reference. The system and method described in the '448 patentapplication utilizes prior probability scores for candidate synonymterms, and substitutes candidate synonym terms in order to determinethose terms that are likely to be the most relevant. Of course, othersystems, means and methods may be used for determining which alternativeterms are to be selected and proposed, and the present invention is notlimited to the specific implementation described in the '488application. For example, the alternative terms may be stored in theterms database 40 with simple identifiers that indicates which terms areto be suggested as an alternative to a particular search query term.

The alternative terms are retrieved by the query processing module 30and those indicated to be likely to be the most relevant to the searchquery are provided to search engine 10 and the user interface module 16.These terms are displayed on an interface screen for the user 6 via theterminal 4 by the user interface module 16, thereby informing the user 6of the additional proposed terms that are being searched for by thesearch system 10 in response to the user's entered search query. Theuser interface module 16 allows the user 6 to de-select any of theproposed terms automatically submitted to the search engine 20 so as todynamically reduce the number of terms the search engine 20 searchesfor. In addition, the user interface 16 further provides additionalfeatures for adding more alternative proposed terms or furtherdisambiguating the senses for the terms used in the search query, asdescribed in further detail below.

FIG. 2 shows an interface screen 100 that is generated by the userinterface module 16 of the search system 10 in accordance one exampleimplementation. As can be seen, the interface screen includes a searchfield 102 for entry of a search query by a user 6, in this case, “kidpool”. The search process is initiated upon clicking on the submitbutton “Dig” 104. It is noted that the search field 102 in theillustrated implementation is merely provided with a single line forentry of the search query. However, for many search queries, the queryprocessing module 30 can suggest better quality alternative terms if theuser 6 enters a long enough query for there to be sufficient contextualinformation to allow topic and domain category selection to be performedaccurately for the search. Unfortunately, users of the Internet areaccustomed to entering fairly short queries which are usually two orthree words. To encourage a different behavior, the search field 102 maybe implemented by the user interface module 16 with multiple linesinstead of the single line shown in the embodiment of FIG. 2. In thisregard, if user types in less than a predetermined number of words intothe search field 102, for example, less than five words, they can beprompted to enter more words by the user interface module 16 in otherimplementations.

In the illustrated embodiment of the search system 10, upon selection ofthe submit button 104, the search engine 20 identifies the documentsthat are relevant to the search query by examining the search index ofthe documents database 22 or by examination of the documents themselves.The results are preferably dynamically displayed in a search resultsfield 106, only one result document being shown as an example, but manydocuments would be displayed in a listing as they are found. A statusfield 108 is also provided on the interface screen 100 by the userinterface module 16 which displays the total number of relevant “hits”found by the search engine 20, and the number of hits being displayed.This feature of the search engine 20 and the user interface module 16may be attained in any appropriate manner, including those used bypresently available search engines. In addition, the interface screen100 also displays other information related to query refinementperformed by the query processing module 30 as discussed herein below.

The interface screen 100 of the illustrated embodiment which isgenerated by the user interface module 16 is also provided with a queryrefinement section 110 that facilitates the user 6 in refining thesearch terms that were submitted to the search engine 20. In thisregard, the interface screen 100 displays numerous alternative terms 112for each of the search terms, together with a selection indicator (inthe form of a check box in the illustrated implementation) to indicatethat the search term user's 6 search query has been expanded to includethe indicated alternative terms 112.

In other words, the query processing module 30 automatically submits tothe search engine 20, the identified alternative terms retrieved fromthe terms database 40 that are most likely to be relevant to the user'ssubmitted search query, for simultaneous search with the originallysubmitted search terms. In addition, the user interface module 16displays these alternative proposed terms to the user 6 in the queryrefinement section 110 of the interface screen 100, thereby informingthe user that the user's search query has been expanded to include theindicated terms. Again, it should be noted that the query refinementsection 110 merely displays to the user 6, select number of the likelypertinent alternative terms for the search terms out of the many thatmay be possible, so as to not detract from the search task.

In the above regard, explanatory text is provided in the queryrefinement section 110 that informs the user 6 that the search queryterm is being interpreted in a certain manner by the search system 10.Thus, as shown in FIG. 2, the query refinement section 110 states “By‘kid’ we assume you meant”, and lists the alternative terms 112, such asbabe, infant, child, etc., which were submitted to the search engine 20for searching as indicated by their corresponding selection status inthe check boxes. The user 6 can de-select any of the check boxes toremove particular terms from being searched for by the search engine 20.Similar text with corresponding alternative terms submitted is providedfor the search query term “pool” as shown.

It should be noted that different number of alternative terms areproposed for the search query terms “kid” and “pool”. This occursbecause the term “kid” has more terms that are likely to be highlyrelevant, whereas the term “pool” has fewer such terms, only two termsbeing deemed sufficient for automatic submission to the search engine20. Thus, the number of terms to be submitted and displayed may be basedon the relevance of the proposed terms to the search query term. Ofcourse, a fixed, predetermined number of alternative terms may be usedas a basis for determining how many alternative terms are displayed andsubmitted in other embodiments.

In addition, a link “Show Excluded Words” 114 is provided which, uponclicking thereon, the user interface module 16 displays the remainingalternative words in the interface screen 100 as described in furtherdetail below. Moreover, alternative senses of the search words aredisplayed in the sense field 116 of the query refinement section 110 inthe illustrated embodiment. In the present implementation, the user 6 isinformed of the alternative senses by the text “Did you mean instead:”with other terms that correspond to an alternative sense of the searchquery terms. Thus, in this case, the user 6 is informed by the searchsystem 10 that the search term “kid” was interpreted to mean a veryyoung baby by the query processing module 30. However, an alternativesense of the same word refers to a child that is older who may also bereferred to as “youngster, kiddy, toddler”. As shown, the alternativeterms in which the search query term “kid” corresponds to this olderchild sense is rendered as a link in the sense field 116 of therefinement section 110.

Furthermore, additional terms that correspond to still other alternativecanonical senses for the search query term may be displayed by the user6 by the selection of the link “Show all” 118. For example, uponselection of the link “Show all” 118, a link with alternative terms“joking, kidding” may be also displayed which informs the user 6 of thisalternative canonical sense of the search query term “kid”. Moreover, inother embodiments, the user may be allowed to selectively combine one ormore of these alternative senses to broaden the search results insteadof merely selecting one sense. This may be attained, for example, bylisting the different senses, and providing a check box to allow theuser to select the desired senses. However, including all of the senseswould diminish the ability of the search system 10 to retrieve onlythose documents relevant to the user's intended search query.

FIG. 3 shows the interface screen 100 of FIG. 2 in which the link “ShowExcluded Words” 114 has been selected by the user 6 to displayadditional proposed terms which were previously hidden in FIG. 2. As canbe seen, expanded alternative terms “kindergartner” and “preschooler”have been selected by the user 6 as indicated by their correspondingcheck boxes. As can also be seen, the non-selected (excluded) terms aregrayed-out in the illustrated implementation to allow the user 6 toquickly peruse and identify those terms that are being searched for bythe search system 10. Thus, the search system 10 in accordance with thepresent invention may be implemented to show the user 6 which of thewords or wordsets should be included as query refinement terms, andwhich terms can be optionally selected for inclusion in a refined searchquery, without requiring the user to type in such refining terms.

In accordance with another embodiment, the search engine 20 of thesearch system 10 may be implemented so that the search query termssubmitted by the user 6, such as the word “pool” in the example searchquery of FIGS. 2 and 3, must be found in the document for retrieval as asearch result, whereas the alternative terms automatically submitted orselected for submission by the user 6 (by indicating, for example, bycheck box), need not be found. Finding of such alternative termshowever, may be used to increase the relevance of the documentcontaining them. In other words, referring to the example search query“kid pool” of FIGS. 2 and 3, the search system 10 may be implemented sothat the word “kid” must be found in the document required for thedocument to be considered as a result for the search, but the term“toddler” merely adds points to the relevancy of the document thatcontains this term, and is not required to be found. Of course in otherimplementations, other strategies may be implemented by the searchsystem 10. For example, another embodiment of the search system 10 maybe implemented so that the alternative terms selected by the user mustbe found in the document. Moreover, the search system 10 may beimplemented so that such features are user selectable.

FIG. 4 illustrates an interface screen 200 in accordance with anotherexample implementation of the present invention including a sense dropdown menu 220 in the refinement section 210 for facilitatingdisambiguation of the sense of the terms of the search query. As can beseen, the search query submitted by the user 6 included the term“magic”, which is interpreted by the query processing module 30 to beused in a canonical sense of the practice or art which invokessupernatural powers. Correspondingly, this sense is displayed in sensefield 214 of the refinement section 210, which in the illustratedsearch, reads “possessing or using or characteristic of or appropriateto supernatural powers”. In addition, the query processing module 30proposes alternative terms such as “magical”, “sorcerous” and “wizardry”from the term database 40 based on this canonical sense of the word“magic”.

Of course, the term “magic” can be used in different senses as well.Thus, in accordance with the illustrated implementation, the sense dropdown menu 220 displays “Did you mean instead . . . ” encouraging theuser to take a look at the other senses of the search term that can beused in proposing alternative terms. In particular, alternative senseselections “any art that invokes supernatural powers” and “an illusoryfeat” are displayed upon selection of the sense drop down menu 220,thereby allowing the user 6 to change the sense of the word. Aspreviously noted, such word sense information may be obtained fromreference sources such as WordNet.

Changing of the sense of the word (to “an illusory feat” for example)causes the query processing module 30 to propose different alternativeterms such as “illusion” and “trick” from the refinement database 40.These alternative terms are then displayed in the refinement section210. Of course, the use of a drop down menu interface as a means forindicating the word sense is merely provided as one example, and otherinterfaces may be used such as listing with check boxes or links, etc.However, the use of a drop down menu interface ensures that thedisplayed refinement section 210 is simple and easy to understand by theuser 6 so as to support the search task, while not detracting from thesearch task.

Furthermore, the sense of the search term which is selected by the user6, as well as various other actions by the user 6, are preferablyrecorded by the search system 10 in records database 50, whether thesense is as originally understood by the query processing module 30, oris specifically selected by the user using the sense drop down menu 220.By merely selecting senses from the drop down menu 220, a userimplicitly votes for the selected sense as being more dominant thananother sense. By keeping track of the number of users that indicate aparticular sense for a particular word in the records database 50, thesearch system 10 can determine which sense a search query is probablybeing used, and which alternative terms the query processing module 30should initially present to the user 6 when the same or similar searchquery is received from a different user.

Of course, instead of the user interface provided above, other means ofdisambiguating the sense of the words of the search query may be used inother implementations of the present invention. For instance,alternative embodiments of the interface screen 100 shown in FIG. 2,verbs, prepositions, nouns and adjectives entered in the search field102 can be probabilistically identified within the search query by thequery processing module 30 by using any statistical part-of-speechtagger, several being available in the art. This information may be usedby the query processing module 30 and displayed in the query refinementsection 110 of the interface screen 100 generated by the user interfacemodule 16. In addition, the query processing module 30 may furtherpropose alternative parts-of-speech in the query refinement section 110of the interface screen 100, for example:

-   -   “By ‘kid’ we assumed you meant the noun (toddler', ‘child).    -   Did you mean the verb (‘joking’, ‘just kidding’)?”

In still another alternative embodiment, the general domain category ofdiscourse, or topic of the query, as guessed through an automatedapparatus, e.g. sports, leisure, business, law, history, etc., can beutilized to bias which alternative terms are selected for expansion ofthe search query and displayed to the user. In this regard, this domaincategory selection itself can be displayed to the user to allow the userto change it, or to select no particular domain category at all. Thus,the drop down menu for the search term “magic” may instead, includeselectable entries such as “entertainment”, “religion” and “culture”corresponding to possible domains of the senses in which the search term“magic” is used. However, such general domains may not be readilyapparent to the user. Therefore, the previously described drop down menuimplementation in which the senses of the word are displayed provides awell balanced approach to disambiguating the senses of the words of theentered search query which supports the search task, but does notdetract from the search task.

In addition, the interface screen 200 of FIG. 4 further includes a sensescoring link “Score These Senses” 230, and a sense adding link “AddSense” 240. FIG. 5 shows the interface screen 200 upon selection of thesense scoring link “Score These Senses” 230. As can be seen in the dropdown sense rating field 234, the senses associated with the search term“magic” are listed, and a rating (number of stars in the illustratedimplementation) is provided for each of the senses based on previousselections by other users.

The interface screen 200 is preferably implemented by the user interfacemodule 16 to allow the user to rate the displayed senses, for example,by selection of one of the senses listed in the sense rating field 234.The user's ratings can then be used by the search system 10 tocorrespondingly adjust the global rating of these senses for theparticular search query term. Preferably, the user's ratings are used tobias or weight the predicted sense for search queries that overlappartly, or completely, with the search query that was submitted by theuser, or otherwise modified by the user. Moreover, such user's ratingsare stored in the records database 50 of the search system 10 to adjustthe global ratings for these senses for subsequently executed searchesfor the same or substantially similar search queries.

FIG. 6 shows an add sense screen 300 that is displayed on the interfacescreen 200 by the user interface module 16 upon selection of the “AddSense” link 240. This feature is illustrated as being used for adding asense for the particular search word, in this example, the search word“magic” which is indicated at the top of the add sense screen 300. Sucha feature can be used when the lexicon contains senses of the searchterm that is entirely different than those identified by the searchsystem 10. For example, the search term “magic” shown and describedabove to FIGS. 4 and 5 pertains to supernatural powers and to theperformance of illusions. However, it may also be used in the sense inwhich we say “it was a magic moment when my baby took her first steps.”In such a case, user 6 could add this sense for the search term usingthe add sense screen 300.

In the above regard, the add sense screen 300 allows the user toindicate whether the word is a proper name, noun, verb, adjective, or anadverb in part-of-speech section 302. The vocabulary type selection 304allows the user to identify whether the word is an ordinary word,technical word, slang word or a vulgar word. In addition, the add sensescreen 300 further provides a user input field that 306 for the user toenter a brief description, synonyms/equivalents, examples, andcontextually related words. These contextually related words arevaluable for identifying the usage of this particular sense by laterusers and/or in source documents. The entered information can be addedby selecting the “OK” button 308 which stores the added information inthe terms database 40 and/or records database 50 of the search system10, thereby adding the new sense for the word “magic” that is retrievedin subsequent searches for the same query term. Of course, the add sensescreen 300 may be implemented to facilitate addition of multiple sensessuccessively until all desired senses for the particular search term areadded.

In the illustrated embodiment of FIG. 6, if the user selects “ProperName” in part-of speech section 302, the add sense screen is slightlymodified to the display sense screen 320 shown in FIG. 7. The vocabularytype selection 324 allows the user to identify whether the name is for aman, woman, place, organization, product, or other name that can beentered in field 325. This is a rudimentary ontological classification,but nonetheless valuable in helping to disambiguate what kind of entityis intended. It is an important compromise between demanding a formalontological specification from the user 6, and asking for none at all.

In addition, the add sense screen 320 further provides a user inputfield that 326 allows the user to enter a brief description, aliases,and contextually related words for the proper name. As can be seen, inthe illustrated example, the word “magic” as used in the sense for“Magic” Johnson, the famous basketball player, is entered by the user.The entered information can be added by selecting the “OK” button 308which stores the added information in the search system 10, therebyadding the new sense for the word “magic” that is retrieved insubsequent searches for the same search query.

Referring again to the general example implementation of the interfacescreen 100 shown FIG. 2, entry into the search query field 102 of wordsthat are used in proper names may be identified by the query processingmodule 30 using any number of Named Entity Extraction tools that areavailable in the art, and the interface screen 100 modified accordingly.For example, if the search query entered contains the words “tigerwoods”, the query refinement section 110 of the interface screen 100 maybe implemented to display:

-   -   “By ‘tiger woods’ we assumed you meant the sports celebrity.    -   Did you mean instead ‘tiger’ and ‘woods’ as regular words and        not as a name?”

When there is ambiguity about a proper name, the query processing module30 automatically selects the one most likely to be intended by the user,based on any other search query terms, or based on prior ratings byother users. However, the user interface module 16 of the search system10 also informs the user of the search system's 10 assumption andpresents the user, the option to modify the search system's 10assumption. For example, there is a famous baseball coach and also afamous actor who are both named Dean Martin. There are fans of bothcelebrities who are actually unaware that the other one exists, and so,would not even know that the name is ambiguous. If the query submittedto the search system 10 by the user 6 is “Dean Martin playing a baseballplayer”, it may be ambiguous to the query processing module 30 whetherthe user is talking about the actor portraying an athlete, or the coachdeciding to utilize an athlete. In such instances, the search system 10in accordance with the present invention may be implemented to displayin the query refinement section 110:

-   -   “By ‘Dean Martin’ we assumed you meant the famous actor.    -   Did you mean instead the famous baseball coach?”

Optionally, since a highly disjunctive search which incorporates severalalternative terms can take much more time and resources to perform bythe search engine 20 than a standard query, the search system 10 may beimplemented so that the user interface module 16 displays progressiveresults of the search on the interface screen 100 as the search is beingperformed. This may be implemented as a time-sequenced display injavascript of the domain categories and clusters that are constrainingthe search, i.e. showing the user what domain categories and clustersthe search engine 20 is searching for the results. This will makewaiting for search results more tolerable and possibly even fun for theuser 6, and provides the user the opportunity to intervene before thesearching is complete.

For example, the search system 10 in accordance with the presentinvention may be implemented so that the user interface module 16displays on the query refinement section 110 of the interface screen100:

-   -   “For your query ‘Dean Martin Bio’ we are searching the        categories:    -   [√] Movies and TV [√] Sports [√] Academia.”

In the above regard, FIGS. 8A to 8C schematically illustrate a domaincategory screen 400 that may be rendered in the status field 108, or inanother area of the interface screen 100. However, the domain categoryscreen 400 is preferably implemented in javascript, and displayed in aseparate window. As can be seen, FIGS. 8A to 8C show three separate,time-sequenced instances (after 2, 5, and 9 seconds) of the domaincategory screen 400 that indicate the domain categories that areconstraining the search being executed by the search engine 20. Thesedomain categories are separated into different columns of the domaincategory screen 400 in the illustrated embodiment to facilitate easycomprehension by the user 6.

Thus, FIG. 8A indicates that the search query “Dean Martin Bio” is nowbeing searched in the domain category of Movies & TV, that context ofthe search is “rat pack”, “comedy” and “singer”, and after two secondsof search, 23 hits have been identified thus far. FIG. 8B shows that thesearch query “Dean Martin Bio” is now being searched in the domaincategory of Sports, that context of the search is “baseball”, “coach”and “playoffs”, and after five seconds of total search, 8 hits have beenidentified, in addition to 37 hits in the prior domain category ofMovies & TV. FIG. 8C indicates that the search query “Dean Martin Bio”is now being searched in the domain category of Academia, that contextof the search is “Administration”, “Faculty” and “Administration”, andafter 9 seconds of search, 34 hits have been identified, in addition to37 hits in the prior domain category of Movies & TV, and 14 hits in theprior domain category of Sports.

The above information is preferably presented immediately on the queryrefinement section 110 of the interface screen 100, before the searchresults are fully displayed on the interface screen, so that the user 6can interrupt the search before it is complete, for example, byunchecking one of the check boxes in domain categories or clusters thatare irrelevant. Thus, if the user 6 was looking for informationregarding Dean Martin, the actor, the user 6 can unselect “Sports”domain category which would substantially reduce the number of relevantdocuments, and the resources required by the search engine 20 of thesearch system 10.

In accordance with the preferred embodiment of the search system 10, thesearch results that are displayed dynamically for the user 6 in thesearch results field 106 as the search engine 20 progresses in thesearch. Furthermore, additional information may be displayed as thesearch system 10 is used, such as displaying estimated volume of hits aswords/senses are selected, real-time suggestions (e.g. one user enters anew word and the search system 10 adds it to another user's suggestionlist in real time), and other statistics.

The search system 10 may be implemented to also display completevariations of a search query, for example, in the status field 108,which can also be useful to the user in refining the search. Inaddition, the hit counts which are displayed in the status field 108 maybe further detailed for each suggested alternative phrase, terms,senses, etc. In particular, even in a static form, it may be useful tothe user 6 to see how many hits resulted from each of the suggestedalternative phrases, terms, senses, etc. Such information provides theuser 6 with clues into the potential value or return of a particularselection of suggested alternative phrases, terms, senses, etc.

If, for example, there are very many variations for saying “kid pool”, apredetermined number of variations in order of hit volume or popularitymay be displayed in the status field 108 of the user interface screen100, and alternatively, could be displayed with an option to view all ofsuch variations. Thus, for the shown search query “kid pool”, thefollowing top five alternatives and the corresponding number of hits maybe displayed in the status field 108 of the interface screen 100:

-   -   “children's swimming pool” (1,450,000)    -   “toddler's swimming pool” (1,180,000)    -   “kiddie pool” (908,000)    -   “children's wading pool” (597,000)    -   “kiddy pool” (318,000).

As noted above, during execution of a search by the search engine 20,various information such as the search terms being processed,alternative terms being considered, etc. may be displayed for the user 6by the user interface module 16. This information provides feedback tothe user 6 and something to occupy the user while waiting for therelevant documents to be identified in the search results field 106,thereby providing a distraction which would make the wait seem shorterthan it is, and further providing a sense that progress is being made.In addition, such information provided to the user 6 educates the user 6as to what the search system 10 is actually doing, laying out the stepsof query refinement and search execution, and also allowing the user tointervene, if desired.

The noted information may be displayed in many ways. For example, suchinformation may be displayed in the user interface screen where theterms are already being shown to the user (e.g. the terms, variations,etc. appear and disappear as analysis progresses) using the existingfonts/colors or in a grayed/lighter color. Alternatively, suchinformation may be displayed in a separate window in front of the userinterface screen, similar to a customary progress dialog, with orwithout a progress bar or other animation such as an hourglass orrotating pie/clock graphic.

It should also be noted that alternate parses of search query terms andcollocated words may also be addressed in one embodiment of the presentinvention. Collocated words are words or terms that form a singlelexical unit that has a particular meaning that generally cannot bedetermined by looking at the individual terms separately, such as “pingpong” or “operating system”. Part of the process of disambiguating whatis intended by the terms of a user's query, even before selecting thecorrect sense of each word, includes determining what constitutes eachlexical unit in the user's query. This can be difficult, especially in anon-formal environment such as user entered queries, where it isexpected that many forms of a collocated word may be entered, even ifnot technically correct (e.g. “pingpong”, “ping-pong” instead of “pingpong”). User 6 would expect similar search results to be returned by thesearch system 10 from a query containing a term of the form“commonsense”, “common-sense”, or “common sense”.

The query processing module 30 of the search system 10 in accordancewith the preferred embodiment is implemented to automatically select theparse that is deemed most likely from the user's search query. If acollocation is selected, it is presented to the user 6 via the userinterface module 16 as if the it were a single semantic element of thesearch query, allowing the user to select a different sense, equivalentwords, etc. in the manner described above. In the event that the parseis incorrect, the user interface module 16 may be implemented to alsoallow the user to alter the parse by way of adding additional senses,for example, in the manner also described.

For example, consider the query “operating system”. By default, thesearch system 10 in accordance with the preferred embodiment isimplemented to consider this query to contain a single word, “operatingsystem” which is understood to be a single semantic unit. In addition tothe senses of the collocated word, another sense can be added:[“operating” and “system”]. This indicates that the user meant the wordsas individual lexical units, which can have vastly different meaningthan the collocation “operating system”. Note that by selecting thisdifferent sense, the user is indicating that they are not looking for“operating system”, as in computer software, and this fact can be usefulto demoting search results that appear to be using “operating system” asa collocation in the computer software sense. Similarly, if the searchsystem 10 automatically selects the parse with individual words, it ispreferably implemented to also offer the collocation as an alternateparse that may be selected by the user.

Further to the above discussion relative to collocations, the occasionalcase of overlapping collocations is also appropriately handled by thesearch system 10. Consider the search query “operating system error”.There are three possible parses:

-   -   [“operating system” “error”];    -   [“operating” “system error”]; and    -   [“operating” “system” “error”].

In response to such a search query, the query processing module 30 ofthe search system 10 may be implemented to automatically select thefirst parse. If this is not what the user intended, the user interfacemodule 16 of the above described implementation offers two ways tochange the parse selection. In this regard, a drop down menu similar tothat shown in the refinement section 210 of the interface screen 200 ofFIG. 4 may be implemented to display the search terms “operatingsystem”, and in the drop down menu selections, to list the virtualsenses of [“operating” “system error”] and [“operating” “system”],thereby allowing the user to select the desired sense. In addition, forthe search term “error”, the drop down menu may be provided which lists[“system error“] as an alternative sense. In other words, the userinterface module 16 may be implemented to provide alternative ways ofcombining or splitting the search query terms. When an alternate parseis selected by the user, the user interface module 16 automaticallysplits pre-existing collocations as necessary, and preserves all prioruser selections that apply to terms of the search query that areunaffected by the change to the parse.

It should be noted that in the preferred implementation of the presentinvention, the sense selection drop down menu 220 is re-purposed toaccommodate the parsing selection in such instances, rather thanproviding another user interface element in the interface screen 200 toaccommodate parsing selection. In other words, it is preferred to havethe sense drop down menu 220 also perform the function of allowing parseselection just to display and allow tweaking of the various possiblecombinations/splittings of the terms of the search query, rather thanproviding a separate drop down menu, although such a separate menu maybe implemented in other embodiments. The adding of an entirely new userinterface element to the interface screen just to accommodate suchparsing, which is a relatively rare need amongst the greater number ofall use-cases, would be quite burdensome to most users, and woulddetract from the search task in most instances. In this regard, mostusers would consider it natural to understand a search query for twoindependent terms “operating” and “system”, to be a different sense ofthe phrase “operating system”.

In any implementations of the present invention described above, it isdesirable for the search system 10 to keep a record of how the user 6modifies a search being performed by the search engine 20 using theabove described user interface and information displayed thereon. Theseinformational records may be stored in the terms database 40 and/orrecords database 50 of the search system 10. Such records can thenpreferably used to improve the accuracy of the query processing module30 in analyzing the received search query so that it can suggest betteralternative search terms, phrases and senses in subsequently submittedsearch queries for the same, or similar, search terms.

Thus, the user's 6 selection by clicking on one or more links to thedocuments retrieved by the search engine 20 is preferably recorded bythe search system 10, including which document was selected, and withwhat time-stamp relative to session start. If the selected document wasretrieved based on its relatedness to a particular alternative searchterm that was suggested by the query processing module 30, suchinformation can be then used by the query processing module 30 toincrease the likelihood that the particular alternative search term beautomatically suggested in the future for the same, or similar, searchquery.

In addition, information may be collected and stored with respect to thedocuments identified and provided as search results by the search system10. In this regard, a simple rating system (for example, by the use ofstars, thumbs-up/thumbs-down, a positive or negative number, yes or noas to relevance, etc.) may be employed by the user interface module 16to collect the user's rating of each document identified as a searchresult in response to the search query. The ratings that are collectedmay be stored in the records database 50, together with informationregarding the date of the search and the rating action.

As noted above, the senses of the search query terms, as selected by thequery processing module 30 and/or the user 6, as well as the alternativeterms equivalent words, are preferably recorded and stored in therecords database 50 and/or the terms database 40. This information isimportant for utilizing the rating information, as opposed to the rawsearch query, and allows for greater re-use of the user ratings sincethe surface lexical units of the search query are reduced to morefundamental semantic units. The surface grammar of the search query mayalso be reduced to a canonical form in order to facilitate matchingbetween search queries of different users that are equivalent. Forexample, the search queries “kid pool” and “pool for children” would beconsidered to be equivalent if the sense selections by the userssubmitting the search queries are the same. Furthermore, additionalinformation such as the username of the user providing the ratings ofthe documents identified in the search results can be recorded andstored in the records database 50 so that the rating of the user can befactored into the ratings provided for the identified documents.

The search result is essentially a URL, and hence, can be used as asecondary key for each rating. The URL may be normalized in order toensure future matching within search results where the URL may havetrivial differences. Supplementary search result information such asexcerpts from the search results, and crawl date may recorded and usedto determine whether the result page has changed since it was rated.Such information may be taken into account when applying previousratings to new searches. This information may also be used to present tothe user, a search result that may not be retrieved by the search engine20 in response to a future search query. In this regard, the originalsort rank of the search result prior to application of the userrating(s) may also be stored in the records database 50.

The user ratings of the obtained search results are incorporated intothe relevancy score of the documents identified as search results thatare provided in response to a search query, hence affecting the finalsort of the documents identified on the search results field 106.Ratings from the user may be given more influence over scores thanratings of the other users for the same query. Thus, when an identifieddocument is rated, its position in the listing displayed on the searchresult field 106 can change immediately.

The changing of the ranking position of the rated document may beattained by the user interface module 16 with, or without, specialgraphical effects. Such graphical effects may cause the particular rateddocument to fade out of its existing position in the listing of thesearch results field 106, and then move and/or fade into its newposition, with the user interface screen 100 scrolling as needed to keepin it in view of the user. Alternatively, the position of the rateddocument can simply appear in the new location on the listing displayedin the search results field 106 upon a subsequent search using a similaror the same query. In still another alternative, the user interfacemodule 16 may be adapted to allow the user 6 to directly manipulate thelocation of a document to a more appropriate position on the listingdisplayed on the search results field 106. This may be attained by clickand drag operation customarily accomplished using a mouse or otherpointing device.

The user interface module 16 may further be implemented to allow theuser to view rated documents of a search result only when they areavailable (either results that the user rated, or that all users rated,or both combined), thus providing a bookmarking type of feature. Inaddition, all of the documents identified through the search that areactually provided to the user 6 (i.e. which pages were served) arepreferably recorded and stored in the records database 50 to allow usein determining which documents that were not rated by the user, werestill possibly reviewed by the users, and which were not even looked at.

Furthermore, the search system 10 may be implemented with an “AdvancedUser” setting wherein a more elaborate interface screen is provided withvarious ancillary features and options and/or to allow the system morecomputation time. For example, in one implementation, a second submitbutton may be provided next to the submit button “Dig” 104, the secondsubmit button being marked “Dig Deeper” which informs the user thatselection of the second submit button provides a more advanced interfaceand options for performing the search. The above described interfacescreens and features may be implemented in such an advanced interface,or in the standard interface.

It should be apparent from the above discussion that the presentinvention provides a computer implemented search method forelectronically searching documents, and providing a search result inresponse to a search query. As described above, the search methodincludes receiving a search query from a user that includes at least onesearch query term, electronically identifying at least one alternativeterm for the at least one search query term, electronically displayingthe identified at least one alternative term for the user, andelectronically executing a search for electronic documents that satisfyat least one of the search query term and alternative term. Asexplained, the search is preferably executed based on the search queryterm and the alternative term simultaneously. The alternative term is asynonym of the search query term, and is preferably used in a same senseas the search query term is used in the search query.

As also explained, the search method may include unselecting thealternative term to indicate that the alternative term is not to besearched. In this regard, the at least one alternative term may be aplurality of alternative terms, and the search method includes searchingfor only a selected number of the plurality of alternative terms, anddisplaying the selected alternative terms for the user. Moreover, themethod may include selecting alternative terms to be searched, andunselecting alternative terms not to be searched. The search method alsoincludes displaying for the user, canonical sense, a domain category,and/or part-of-speech in which the at least one search query term ispresumed to be used. In addition, the method includes displayingalternatives that can be selected by the user.

Thus, it should be apparent from the above discussion that the presentinvention provides a search system and method in which, through a userinterface, provides a reactive environment for enabling human users togreatly facilitate expanding, correcting, and refining a search querythat has just been, or is about to be, submitted to the search engine ofthe search system. As can be appreciated from the description above, thesystem and method present invention increases recall while preserving,or even increasing precision. Importantly, the search system and methodof the present invention allows achieving better search results in ashorter amount of time than would likely be attained by manual searchquery expansion and refinement by the user.

In particular, the present invention greatly facilitates queryrefinement by proposing alternative search terms to the user in a waythat is convenient and effective. Such proposed alternative search termsoffer linguistically related, but not always contextually relevant,synonyms, which makes approximate, but not perfect, domain category andsense assumptions about a search query. The user can then interact viathe user interface as provided by the search system of the presentinvention to indicate appropriate alternative search terms andrespective senses that can be used by the search engine to retrieve thedesired search results.

While various embodiments in accordance with the present invention havebeen shown and described, it is understood that the invention is notlimited thereto. The present invention may be changed, modified andfurther applied by those skilled in the art. Therefore, this inventionis not limited to the detail shown and described previously, but alsoincludes all such changes and modifications.

1. A search system for searching for electronic documents, and providinga search result in response to a search query, comprising: a processor;a database, wherein terms in the database are grouped in a cluster basedon the terms belonging to a same word sense; a user interface moduleadapted to receive a search query from a user, said search query havingat least two search query terms; a query processing module thatautomatically determines a single, assumed word sense for one of thesearch query terms of the at least two search query terms selected froma list of retrieved senses for the search query term based on othersearch query terms of the at least two search query terms, wherein thequery processing module identifies a plurality of any availablealternative terms for the one of the search query terms from a clusterbased on the determined sense, wherein the alternative terms include anyavailable synonyms that are within the determined sense; a search enginethat electronically searches for electronic documents that satisfy saiddetermined sense of said at least one search query term; and whereinsaid user interface module is further adapted to display search resultsreceived from the search engine.
 2. The search system of claim 1,wherein said search engine executes a search based on said search queryterm and said plurality of any available alternative termssimultaneously.
 3. The search system of claim 2, wherein said userinterface only displays said search results that satisfy at least onesearch query term and ranks said search results based at least in parton whether said search results satisfy said any available alternativeterms.
 4. The search system of claim 1, wherein said processing moduleautomatically determines the single assumed word sense utilizing a partof speech tagger.
 5. The search system of claim 1, wherein saidprocessing module automatically determines the single assumed word senseutilizing a database of word senses linked to contextual words.
 6. Thesearch system of claim 1, wherein said processing module automaticallydetermines the single assumed word sense based whether the search queryterms overlap previous search queries for the word sense.
 7. The searchsystem of claim 1, wherein said cluster is a collection of words groupedtogether based on having similar usage patterns.
 8. The search system ofclaim 1, wherein documents that satisfy said determined sense of said atleast one search query term are determined by context words around theword hits.
 9. A computer implemented search method for electronicallysearching documents, and providing a search result in response to asearch query, comprising: receiving, using a user interface modulecontrolled by a computer processor, a search query from a user, saidsearch query having at least two search query terms; electronically andautomatically determining, using a query processing module controlled bya computer processor, a single, assumed sense of one term of the atleast two search query terms selected form a list of retrieved sensesfor the one search query term based on other search query terms of theat least two search query terms; electronically identifying, using thequery processing module controlled by a computer processor, a pluralityof available alternative terms from a cluster of terms belonging to asame word sense for the at least one search query term based on thedetermined sense, wherein the plurality of available alternative termsinclude any available synonyms within the determined sense; andelectronically executing a search, using a search engine, for electronicdocuments that satisfy at least the one of the at least two search queryterms and the plurality of available alternative terms.
 10. The searchmethod of claim 9, further including executing the search based on saidsearch query term and said plurality of available alternative termssimultaneously.
 11. The search method of claim 10, further includingdisplaying for the user, a part-of-speech in which said at least onesearch query term is presumed to be used.
 12. The search method of claim9, wherein said processing module automatically determines the singleassumed word sense utilizing a part of speech tagger.
 13. The searchmethod of claim 9, wherein said processing module automaticallydetermines the single assumed word sense utilizing a database of wordsenses linked to contextual words
 14. The search method of claim 9,wherein said processing module automatically determines the singleassumed word sense based whether the search query terms overlap previoussearch queries for the word sense.
 15. A computer readable hardwaremedium with executable instructions stored thereon, which when executedby a computer processor, cause said computer to execute a method forsearching for electronic documents, and providing a search result inresponse to a search query, the method comprising: receiving a searchquery from a user, said search query having at least two search queryterms; electronically and automatically determining a single, assumedsense of one term of the at least two search query terms selected form alist of retrieved senses for the one search query term based on othersearch query terms of the at least two search query terms;electronically identifying a plurality of available alternative termsfrom a cluster of terms belonging to a same word sense for the at leastone search query term based on the determined sense, wherein theplurality of available alternative terms include any available synonymswithin the determined sense; and electronically executing a search forelectronic documents that satisfy at least one of said search query termand the plurality of alternative terms.
 16. The computer readable mediumof claim 15, further comprising executing the search based on saidsearch query term and said plurality of available alternative termssimultaneously.
 17. The computer readable medium of claim 15, furthercomprising displaying for the user, a part-of-speech in which said atleast one search query term is presumed to be used.
 18. The computerreadable medium of claim 15, further comprising automaticallydetermining the single assumed word sense utilizing a part of speechtagger.
 19. The computer readable medium of claim 15, further comprisingautomatically determining the single assumed word sense utilizing adatabase of word senses linked to contextual words.
 20. The computerreadable medium of claim 15, further comprising automaticallydetermining the single assumed word sense based whether the search queryterms overlap previous search queries for the word sense.